翻訳と辞書
Words near each other
・ Nokia
・ Nokia (disambiguation)
・ Nokia 100
・ Nokia 101
・ Nokia 1011
・ Nokia 103
・ Nokia 105
・ Nokia 106
・ Nokia 1100
・ Nokia 1110
・ Nokia 1112
・ Nokia 1202
・ Noisy scrubbird
・ Noisy Stylus
・ Noisy text
Noisy text analytics
・ Noisy Water Winery
・ Noisy-channel coding theorem
・ Noisy-Diobsud Wilderness
・ Noisy-le-Grand
・ Noisy-le-Roi
・ Noisy-le-Sec
・ Noisy-le-Sec (Paris RER)
・ Noisy-Rudignon
・ Noisy-storage model
・ Noisy-sur-Oise
・ Noisy-sur-École
・ Noita
・ Noita (album)
・ Noita palaa elämään


Dictionary Lists
翻訳と辞書 辞書検索 [ 開発暫定版 ]
スポンサード リンク

Noisy text analytics : ウィキペディア英語版
Noisy text analytics
Noisy text analytics is a process of information extraction whose goal is to automatically extract structured or semistructured information from noisy unstructured text data. While Text analytics is a growing and mature field that has great value because of the huge amounts of data being produced, processing of noisy text is gaining in importance because a lot of common applications produce noisy text data. Noisy unstructured text data is found in informal settings such as online chat, text messages, e-mails, message boards, newsgroups, blogs, wikis and web pages. Also, text produced by processing spontaneous speech using automatic speech recognition and printed or handwritten text using optical character recognition contains processing noise. Text produced under such circumstances is typically highly noisy containing spelling errors, abbreviations, non-standard words, false starts, repetitions, missing punctuations, missing letter case information, pause filling words such as “um” and “uh” and other texting and speech disfluencies. Such text can be seen in large amounts in contact centers, chat rooms, optical character recognition (OCR) of text documents, short message service (SMS) text, etc. Documents with historical language can also be considered noisy with respect to today’s knowledge about the language. Such text contains important historical, religious, ancient medical knowledge that is useful. The nature of the noisy text produced in all these contexts warrants moving beyond traditional text analysis techniques.
== Techniques for noisy text analysis ==
Missing punctuation and the use of non-standard words can often hinder standard natural language processing tools such as part-of-speech tagging
and parsing. Techniques to both learn from the noisy data and then to be able to process the noisy data are only now being developed.

抄文引用元・出典: フリー百科事典『 ウィキペディア(Wikipedia)
ウィキペディアで「Noisy text analytics」の詳細全文を読む



スポンサード リンク
翻訳と辞書 : 翻訳のためのインターネットリソース

Copyright(C) kotoba.ne.jp 1997-2016. All Rights Reserved.